SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

APPLICANT: LI, ET AL . 

TITLE OF INVENTION: Human G- Protein Coupled 

Receptors 

NUMBER OF SEQUENCES: 26 
CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: CARELLA, BYRNE, BAIN, GILFILLAN, 

CECCHI, STEWART & OLSTEIN 

(B) STREET: 6 BECKER FARM ROAD 

(C) CITY: ROSELAND 

(D) STATE: NEW JERSEY 

(E) COUNTRY: USA 

(F) ZIP: 07068 

COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: 3.5 INCH DISKETTE 

(B) COMPUTER: IBM PS/2 

(C) OPERATING SYSTEM: MS-DOS 

(D) SOFTWARE: WORD PERFECT 5.1 

CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: Concurrently 

(C) CLASSIFICATION: 

PRIOR APPLICATION DATA 

(A) APPLICATION NUMBER: PCT/US95/04079 

(B) FILING DATE: 30 MAR 1995 



(viii) ATTORNEY/ AGENT INFORMATION: 

(A) NAME: FERRARO, GREGORY D. 

(B) REGISTRATION NUMBER: 36,134 

(C) REFERENCE/DOCKET NUMBER: 325800-379 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 201-994-1700 

(B) TELEFAX: 201-994-1744 

(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 1713 BASE PAIRS 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS : SINGLE 

(D) TOPOLOGY: LINEAR 
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(i) 
(ii) 

(iii) 
(iv) 



(V) 



(vi) 



(vii) 



(ii) 



MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

GGCACGAGGT CATTCAACAT TTATTCAACC AAAAATACTA AGTCAGCTCT ATACAAACTA 60 

ATGGAAGGAT ACAGCTATGC AAATATAGAA CACTAAAGTG TTACATGACA GATGTATGAG 120 

TAGTGAAATG GTGAAAAATC AGACAATGGT CACAGAGTTC CTCCTACTG6 GATTTCTCCT 180 

GGGCCCAAGG ATTCAGATGC TCCTCTTTGG GCTCTTCTCC CTGTTCTATG TCTTCACCCT 240 

GCTGGGGAAT GGGACCATCC TGGGGCTCAT CTCACTGGAC TCCAGACTCC ACACCCCCAT 300 

GTACTTCTTC CTCTCACACC TGGCCGTCGT CAACATCGCC TATGCCTGCA ACACAGTGCC 360 

CCAGATGCTG GTGAACCTCC TGCATCCAGC CAAGCCCATC TCCTTTGCTG GTTGCATGAC 42 0 

ACTAGACTTT CTCTTTTTGA GTTTTGCACA TACTGAATGC CTCCTGTTGG TGCTGATGTC 480 

CTACGATCGG TACGTGGCCA TCTGCCACCC TCTCCGATAT TTCATCATCA TGACCTGGAA 54 0 

AGTCTGCATC ACTCTGGGCA TCACTTCCTG GACATGTGGC TCCCTCCTGG CTATGGTCCA 600 

TGTGAGCCTC ATCCTAAGAC TGCCCTTTTG TGGGCCTCGT GAAATCAACC ACTTCTTCTG 660 

TGAAATCCTG TCTGTCCTCA GGCTGGCCTG TGCTGATACC TGGCTCAACC AGGTGGTCAT 72 0 

CTTTGAAGCC TGCATGTTCA TCCTGGTGGG ACCACTCTGC CTGGTGCTGG TCTCCTACTC 780 

ACACATCCTG GGGGGCATCC TGAGGATCCA GTCTGGGGAG GGCCGCAGAA AGGCCTTCTC 840 

CACCTGCTCC TCCCACCTCT GCGTAGTGGG ACTCTTCTTT GGSAGCGCCA TCGTCATGTA 900 

CATGGCCCCT AAGTCCCGCC ATCCTGAGGA GCAGCAGAAG GTCCTTTTTC TTATTTTACA 960 

GTTCCTTTCA ACCCCGATGC TTAAACCCCC TGATTTACAA CCCTGAGGAA TGTAGAGGGT 1020 

CAAGGGTGCC CTCCGAGGAG ACCACTGTGC AARGRAAGTC ATTCCTAAGG GGTGTGACAT 1080 

TTGAACTGCC AGCCCCAGTT GCCCCGTGGA CTCCTGATGC CCAATTATTG CCTCAACCCA 1140 

il-i GAAAAGTTTA CTCCCCTTTA ACTGTGCTTT ACTGACAGAA GGGCAAGCCT TCTCCCGTTT 1200 

TTTGCAGATA AAATTTTAGA TGTGTTGCAA TCATTGGGTT TCTAGGAGAT GTGGTTTTAT 1260 

CAGACAATTT TTTCTTTTAT TTCACAATTA CTTTAATATC TGTAAAATAA AGAATTATTT 1320 

C3 TAAATCATTT TCCCAGTCCC AAAAGTTAAA TACAGGCCAC TTACTTCTTT AACCAAATGA 1380 

fij TATAGTTTGG CTCTGTGTCC CCACCCAAAT CTCATGTCAA ATTGTAATCC CCGCATGTCA 1440 

GCGGAGGGAC CTGGTGGGAG 6TGATTGGAT CATGGGGAGG GATTTCCCCC TTGCTGTTCT 1500 

'C GTTGATAGTG AACGAGTTCT CACGAAATCT GATGGTTTAA AAGTGCAGCA CTTCTCCCTT 1560 

TGCTCTCTCT CTCCTGCTGT GCCATGGTAA GACGTGCCTT GCTTCCCCTG GTGCTTCCGC 1620 

M-l CATGATTGTA CCTTTCCTGA GGCCTCTCCA GCCATGTGGA ACTGTGAGCC AATTAAACTT 1680 

CTTTTCTTTA GAAA2VAAAAA AAAAAAAAAA AAA 1713 

(2) INFORMATION FOR SEQ ID NO:2: 
hi- (1) SEQUENCE CHARACTERISTICS 

r1 (A) LENGTH: 296 AMINO ACIDS 

If (B) TYPE: AMINO ACID 

(C) STRANDEDNESS : 
f'f (D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 



(xi) SEQUENCE DESCRIPTION: 



Met 


Ser 


Ser 


Glu 


Met 
5 


Val 


Lys 


Leu 


Leu 


Leu 


Gly 


Phe 


Leu 


Leu 










20 






Phe 


Gly 


Leu 


Phe 


Ser 


Leu 


Phe 








35 






Gly 


Thr 


He 


Leu 


Gly 


Leu 


He 








50 






Pro 


Met 


Tyr 


Phe 


Phe 


Leu 


Ser 










65 






Tyr 


Ala 


Cys 


Asn 


Thr 


Val 


Pro 










80 






Pro 


Ala 


Lys 


Pro 


He 


Ser 


Phe 



SEQ ID 


NO: 


2 : 










Asn 


Gin 


Thr 


Met 


Val 


Thr 


Glu 


Phe 






10 










15 


Gly 


Pro 


Arg 


He 


Gin 


Met 


Leu 


Leu 




25 










30 


Tyr 


Val 


Phe 


Thr 


Leu 


Leu 


Gly Asn 




40 










45 


Ser 


Leu 


Asp 


Ser Arg 


Leu 


His 


Thr 






55 










60 


His 


Leu 


Ala 


Val 


Val 


Asn 


He 


Ala 






70 










75 


Gin 


Met 


Leu 


Val 


Asn 


Leu 


Leu 


His 






85 










90 


Ala 


Gly 


Cys 


Met 


Thr 


Leu 


Asp 


Phe 
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Leu. 




Leu 


Ser 


110 


AT a 


Hi <5 


jyiec 


Ser 


Tyr Asp 






V CLX 
















file 


He 


He 


Met 


L Xli 
X *± VJ 


irp 




Seir 


Trp 


Thr 


Cys 


155 






Xxe 


Leu Arg 


Leu 


Jr X O 














170 






Pne 


Cys 


Glu 


He 






V dx 








185 






Tirp 


Leu 


Asn 


Gin 


Va.X 


V ciX 


X xc 








o n n 

^ U VJ 






Val 


Gly 


Pro 


Leu 


v_ys 
215 


Leu 




Gly 


Gly 


He 


Leu 


Arg 

230 


xxe 


V7XI1 


Phe 


Ser 


Thr 


Cys 


Ser 
245 


Ser 


His 


Gly 


Ser 


Ala 


He 


Val 


Met 


Tyr 








260 






Glu 


Glu 


Gin 


Gin 


Lys 
275 


Val 


Leu 


Thr 


Pro 


Met 


Leu 


Lys 


Pro 


Pro 



290 







X u u 










X VJ D 


Thr 


Glu 


Cys 

XXO 


Leu 


Leu 


Leu 


Val 


Leu 

J. ^ VJ 


Ala 


He 


Cys 

Xo u 


His 


Pro 


Leu 


Arg 


Tyr 

x^ o 


Val 


Cys 


He 

14i3 


Thr 


Leu 


Gly 


He 


Thr 

X3U 


Leu 


Ala 


Met 

Xb U 


Val 


His 


Val 


Ser 


Leu 

net; 

X D D 


Gly 


Pro 


Arg 

X /o 


Glu 


X xe 


Asn 


nxo 


XOU 


Leu 


Arg 


Leu 

1 y u 


Ala 


Cys 


Ala 


Asp 


Thr 
xy D 


Phe 


Glu 


Ala 

O A C 


Cys 


Met 


Phe 


He 


Leu 
^xu 


Leu 


Val 


Ser 


Tyr 


Ser 


His 


He 


Leu 






**> A 

220 








o o c 


Ser 


Gly 


Glu 

o o c 


Gly 


Arg 


Arg 


Lys 


Ala 

O A n 


Leu 


Cys 


Val 

o ^ 

25 0 


Val 


Gly 


Leu 


Phe 


Phe 

b b 


Met 


Ala 


Pro 
265 


Lys 


Ser 


Arg 


His 


Pro 
270 


Phe 


Leu 


He 

280 


Leu 


Gin 


Phe 


Leu 


Ser 
285 


Asp 


Leu 


Gin 
295 


Pro 











(2) INFORMATION FOR SEQ ID NO: 3: 



(i) SEQUENCE CHAELACTERISTICS 

(A) LENGTH: 2185 BASE PAIRS 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS : SINGLE 

(D) TOPOLOGY: LINEAR 



(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

TCACTATAGG GCGAATTGGG TACGGGCCCC CCCTCGAGGT CGACGGTATC GATAAGCTTG 60 

ATATCGAATT CGGCACGAGC CGGGCTCGGA GAGGTGACGG AACCGGGGCT GGTAGCATAG 120 

TTTGATTTGA TGATGGAGCC AACACAGGGG TTGGAGCTGG TACCGGTGAA GCTGAGGCTA 180 

AAAAGGTTCC TGGAGTAGAC GATGGAGCCA TAACTGGAAC CGGAGTCTGT GAATGAAGCC 240 

AGGACAGGAG CAGCACCTGG CGATGGTGCC AGGACCGGAA GAGGAGCCAG AGGAGGAGCT 300 

GGAGAAGGAG CCAGAATTGC TGTCTGTGGA GCCGCCATAG GAGCCAGAGG GGTGGCTAGA 360 

GCCTGAGAAT GCAGAAGATG CTGGAGCCAG AAGGGAAGCC TGA.GCTGGAG CTGGATTTGG 420 

TGCTGACGGA AAAGGACTGG CCAGAGCCGA AGCTGGCACC AGGGACAGGT GAGCATTCTG 480 

GGGCCACGGT TGAGTTCAAC CCACTGACTT CAGGTGAAGG ACTGTGGACC AGCTTGAGAA 54 0 

GAGGCCTCAC CAGAGTGGGT GTGGGGCATG GGGGCTCGAG CAGTACCCAG AGTAGGTGTG 600 

GGTAGCCCGG CCAGGGGTTA ACGTGGGGCG TGGATTCAAC ACAGCTTGGA AGCCCAGAGC 66 0 

TCGGAGGCCC GGGTGCTTGG GCCAATTGAG GAACAGGAGT CAGTCCATCC CGAGGGGGTT 72 0 

GTCTCACTAC AATCTTCACA CGCCTTTATT ATTCACCATG GTTGGTGGCA CCTGGTTAGC 780 

AGCAAGCGGA AGGCTGAGGC CAGTAGGGGC AGGGGTGTTA CTGGGGGTCG AAGAAGCCAG 84 0 

CACAGAGACA GGGGTAGGGC CAGGGGTCGG GGCCACGGCC TGGATGAGGC CCACATGGGC 900 

AGGCTGGCTG ATGAGATGGT GCTGCCCCCC TGCTGACACG AGGTGCACCA CATTCCTTTG 960 
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CAGCGGGCGG GCTGCCCCAC AGCAAGCTGG CGCACCTGGG CACCATCCAA AATACAGCTT 1020 

GTTTCCCTGG ATTTGGAAGG TGAGAGGTTT GCTTCCCCCT CCATTAACCA CTGACGTTGT 108 0 

GCCAGTGAGA CTAACTCTCC GCGCCAATCT GTCCGCGGCT GACCTCCTTC GCGGGCGTGG 1140 

CCTACCTCTT CCTCATGTTC CACACTGTCC CCGCACAGCC CGACTTTCAC TTGAGGGCTG 1200 

GTTCCTGCGG CAGGGCTTGC TGGACACAAA CCTCACTGCG TCGGTGGCCA CACTGCTGGC 1260 

CATCGCCGTG GAGCGGCACC GCAGTGTGAT GGCCGTGCAG CTGCACAGCC GCCTGCCCCG 132 0 

TGGCCGCGTG GTCATGCTCA TTGTGGGCGT GTGGGTGGCT GCCCTGGGCC TGGGGCTGCT 1380 

GCCTGCCCAC TCCTGGCACT GCCTCTGTGC CCTGGACCGC TCCTCACGCA TGGCACCCCT 144 0 

GCTCAGCCGC TCCTATTTGG CCGTCTGGGC TCTGTCGAGC CTGCTTGTCT TCCTGCTCAT 150 0 

GGTGGCTGTG TACACCCGCA TTTTCTTCTA CGTGCGGCGG CGAGTGCAGC GCATGGCAGA 1560 

GCATGTCA6C TGCCACCCCC GCTACCGAGA GACCACGCTC AGCCTGGTCA AGACTGTTGT 1620 

CATCATCCTG GGGGCGTTCG TGGTCTGCTG GACACCAGGC CAGGTGGTAC TGCTCCTGGA 1680 

TGGTTTAGGC TGTGAGTCCT GCAATGTCCT GGCGTTAGAA AAGTACTTCC TACTGTTGGC 174 0 

CGAGCCAACC TCACTGGTCA ATGCTGCTGT GTACTCTTGC CGAGATGCTG AGATGCGCCG 1800 

CACCTTCCGC CGCCTTCTCC TGCTGCGCGT GCCTCCGCCA GTCCACCCGC GAGTCTGTCC 1860 

ACTATACATC CTCTGCCCAG GGAGGTGCCA GCACTCGCAT CATGCTTCCC 6AGAACGGCC 1920 

ACCCACTGAT GGACTCCACC CTTTAGCTAC CTTGAACTAC AGCGGTACGC GGCAAGCAAC 1980 

AAATCCACAG CCCCTGATGA CTTGTGGGTG CTCCTGGCTC AACCCAACCT CGTGCCGAAT 2040 

TCCTGCAGCC CGGGGGATCC ACTAGTTCTA GAGCGGCGCC ACCGCGGTGG AGCTCCAGCT 2100 

TTTGTTCCCT TTAGTGAGGG TTAATTTCGA GCTTGGC6TA ATCATGGTCA TAGCTGTTTC 2160 

CTGTGTGAAA TTGTTATCCG CTCAC 2185 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 393 AMINO ACIDS 

(B) TYPE: AMINO ACID 

(C) STRANDEDNESS : 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 



Met 


Arg 


Pro 


Thr 


Trp 


Ala 


Gly 


Trp 


Leu 


Met 


Arg Trp 


Cys 


Cys 


Pro 








5 










10 








15 


Pro 


Ala 


Asp 


Thr 


Arg 


Cys 


Thr 


Thr 


Phe 


Leu 


Cys Ser Gly Arg 


Ala 








20 










25 








30 


Ala 


Pro 


Gin 


Gin 


Ala 


Gly 


Ala 


Pro 


Gly His 


His Pro 


Lys 


Tyr 


Ser 










35 










40 








45 


Leu 


Phe 


Pro 


Trp 


lie 


Trp 


Lys 


Val 


Arg 


Gly 


Leu Leu 


Pro 


Pro 


Pro 










50 










55 








60 


Leu 


Thr 


Thr 


Asp 


Val 


Val 


Pro 


Val 


Arg 


Leu 


Thr Leu 


Arg 


Ala 


Asn 








65 










70 








75 


Leu 


Ser 


Ala 


Ala 


Asp 


Leu 


Leu 


Arg 


Gly Arg 


Gly Leu 


Pro 


Leu 


Pro 










80 










85 








90 


His 


Val 


Pro 


His 


Cys 


Pro 


Arg 


Thr 


Ala 


Arg 


Leu Ser 


Leu 


Glu 


Gly 










95 










100 








105 


Trp 


Phe 


Leu 


Arg 


Gin 


Gly 


Leu 


Leu 


Asp 


Thr 


Asn Leu 


Thr 


Ala 


Ser 








110 










115 








120 


Val 


Ala 


Thr 


Leu 


Leu 


Ala 


He 


Ala 


Val 


Glu 


Arg His 


Arg 


Ser 


Val 










125 










130 








135 


Met 


Ala 


Val 


Gin 


Leu 


His 


Ser 


Arg 


Leu 


Pro 


Arg Gly Arg Val 


Val 










140 










145 








150 


Met 


Leu 


lie 


Val 


Gly 


Val 


Trp 


Val 


Ala 


Ala 


Leu Gly Leu Gly 


Leu 










155 










160 








165 


Leu 


Pro 


Ala 


His 


Ser 


Trp 


His 


Cys 


Leu 


Cys 


Ala Leu 


Asp Arg 


Ser 










170 










175 








180 
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Ser Arg Met Ala Pro Leu Leu Ser Arg Ser Tyr Leu Ala Val Trp 

185 190 195 

Ala Leu Ser Ser Leu Leu Val Phe Leu Leu Met Val Ala Val Tyr 

200 205 210 

Thr Arg lie Phe Phe Tyr Val Arg Arg Arg Val Gin Arg Met Ala 

215 220 225 

Glu His Val Ser Cys His Pro Arg Tyr Arg Glu Thr Thr Leu Ser 

230 235 240 

Leu Val Lys Thr Val Val lie lie Leu Gly Ala Phe Val Val Cys 

245 250 255 

Trp Thr Pro Gly Gin Val Val Leu Leu Leu Asp Gly Leu Gly Cys 

260 265 270 

Glu Ser Cys Asn Val Leu Ala Leu Glu Lys Tyr Phe Leu Leu Leu 

275 280 285 

Ala Glu Pro Thr Ser Leu Val Asn Ala Ala Val Tyr Ser Cys Arg 

290 295 300 

Asp Ala Glu Met Arg Arg Thr Phe Arg Arg Leu Leu Leu Leu Arg 

305 310 315 

Val Pro Pro Pro Val His Pro Arg Val Cys Pro Leu Tyr lie Leu 

320 325 330 

Cvs Pro Gly Arg Cys Gin His Ser His His Ala Ser Arg Glu Arg 

335 340 345 

Pro Pro Thr Asp Gly Leu His Pro Leu Ala Thr Leu Asn Tyr Ser 

350 355 360 

Glv Thr Arg Gin Ala Thr Asn Pro Gin Pro Leu Met Thr Cys Gly 

365 370 375 

Cvs Ser Trp Leu Asn Pro Thr Ser Cys Arg lie Pro Ala Ala Arg 

380 385 390 

Gly He His 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 1474 BASE PAIRS 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS : SINGLE 

(D) TOPOLOGY: LINEAR 



(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: 

CGGCACGAGC ATAAGAAGAC AGAGAGAACT 
AATGAACACC ACAGTAATGC AAGGCTTGAA 
GATAGTACAG CTGGTATTCC CAGCCCTCTA 
GAATACTTTG GCTCTGTGGG TGTTTGTTCA 
CCTCAAAAAC ACTTTGGTGG CCGACTTGAT 
CTCTGACTCA CACCTGGCAC CCTGGCAGCT 
GATATTTTAT GAGACCATGT ATGTGGGCAT 
ATTCCTCAAG ATCATCAGAC CTTTGAGAAA 
AACGGTCTCA ATCTTCATCT GGTTCTTTTG 
GAGCAACAAG GAAGCAACAC CATCGTCTGT 
GGGGCTGAAA TGGCATCAAA TGGTAAATAA 
TATCCTAATG CTTGTGTTTT ATGTGGTTAT 
TCCAAAAGTA AGGACAGAAA AAACAACAAA 



SEQ ID NO: 5: 

GAGTATCCTC CCAAAGGTGA CACTGGAAGC 60 

CAGATCTAAG CGGTGCCCCA AAGACACTCG 120 

CACAGTGGTT TTCTTGACCG CAATCCTGCT 18 0 

CATCCCCAGC TGGTCCACCT TCATCATCTA 24 0 

AATGACAGTG ATGCTTCCTT TCAAAATCCT 300 

CAGAGCTTTT GTGTGTCGTT TTTCTTCGGT 360 

AGTGCTGTTA GGGCTCATAG CCTTTGACAG 420 

TATTTTTCTA AAAAAACCTG TTTGGGGAAA 480 

GTTCTTCATC TCCCTGCCAA ATATGATCTT 540 

GAAAAAGTGT GCTTCCTTAA AGGGGCCTCT 60 0 

CATATGCCAG TTTATTTTCT GGACTGTTTT 66 0 

TGCAAAAAAG TATATGATTC TTATAGAAAG 720 

AAGCTGGAAG GCAAAGTATT TGTTGTCGTG 780 
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GCTGTCTTCT TTGTGTGTTT TGCTCCATTT CATTTCGCCA GAGTTCCATA TACTCACAGT 
CAAACCAACA ATAAGACTGA CTGTAGACTG CAAAATCAAC TGTTTATTGC TAAAGAAACA 
ACTCTCTTTT TGGCAGCAAC TAACATTTGT ATGGATCCCT TAATATACAT ATTCTTATGT 
AAAAAATTCA CAGAAAAGCT ACCATGTATG CAAGGGAGAA AGACCACAGC ATCAAGCCAA 
GAAAATCATA GCAGTCAGAC AGACAACATA ACCTTAGGCT GACAACTGTA CATAGGGGTA 
ACTTCTATTT ATTGATGAGA CTTCCGTAGA TAATGTGGAA ATCCAATTTA ACCAAGAAAA 
AAAGATTGGG GCAAATGCTC TCTTACATTT TATTATCCTG GTGTACAGAA AAGATTATAT 
AAAATTTAAA TCCACATAGA TCTATTCATA AGCTGAATGA ACCATTACTA AGAGAATGCA 
ACAGGATACA AATGGCCACT AGAGGTCATT ATTTGTTTCT TTCTTTCTTT ttTTTTTTTT 
AATTTCaAGA GCATTTCACT TTAACATTTT GGAAAAGACT AAGGAGAAAC GTATATCCCT 
ACAAACCTCC CCTCCAAACA CCTTCTTACA TTCTTTTCCA CAATTCACAT AACACTACTG 
CTTTTGTGCC CCTTAAATGT AGATTTGTTG GCTG 

(2) INFORMATION FOR SEQ ID NO: 6: 
(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 293 AMINO ACIDS 

(B) TYPE: AMINO ACID 

( C ) STRANDEDNESS : 

(D) TOPOLOGY: LINEAR 



(ii) MOLECULE TYPE: PROTEIN 



(xi) SEQUENCE DESCRIPTION: 


SEQ ID 


NO: 6 : 










lYiei, 


Asn 


Thr 


Thr 


Val 


Met 


Gin 


Glv 


Phe 


Asn 


Arg 


Ser 


Lys 


Arg 


Cys 








5 










10 










15 


Pro 


Lvs 


Asp 


Thr 


Arg 


He 


Val 


Gin 


Leu 


Val 


Phe 


Pro 


Ala 


Leu 


Tyr 






20 










25 










30 


Thr 


Val 


Val 


Phe 


Leu 


Thr 


Gly 


He 


Leu 


Leu 


Asn 




J-l w u 


Ala 


Leu 










35 








40 










45 


Trp 


Val 


Phe 


Val 


His 


He 


Pro 


Ser 


Ser 


Ser 


Thr 


Phe 


He 


He 


Tyr 








50 










55 










60 


Leu 


Lys 


Asn 


Thr 


Leu 


Val 


Ala 


Asp 


Leu 


He 


Met 


Thr 


Leu 


Met 


Leu 








65 










70 










75 


Pro 


Phe 


Lys 


He 


Leu 


Ser 


Asp 


Ser 


His 


Leu 


Ala 


Pro 


Trp 


Gin 


Leu 








80 










85 










90 


Arg 


Ala 


Phe 


Val 


Cys 


Arg 


Phe 


Ser 


Ser 


Val 


He 


Phe 


Tyr 


Glu 


Thr 








95 










100 










105 


Met 


Tyr 


Val 


Gly 


He 


Val 


Leu 


Leu 


Gly 


Leu 


He 


Ala 


Phe 


Asp 


Arg 






110 










115 










120 


Phe 


Leu 


Lys 


He 


He 


Arg 


Pro 


Leu 


Arg 


Asn 


He 


Phe 


Leu 


Lys 


Lys 








125 










130 










135 


Pro 


Val 


Trp 


Gly Lys 


Thr 


Val 


Ser 


He 


Phe 


He 


Trp 


Phe 


Phe 


Trp 








140 










145 










150 


Phe 


Phe 


He 


Ser 


Leu 


Pro 


Asn 


Met 


He 


Leu 


Ser 


Asn 


Lys 


Glu 


Ala 








155 










160 










165 


Thr 


Pro 


Ser 


Ser 


Val 


Lys 


Lys 


Cys 


Ala 


Ser 


Leu 


Lys 


Gly 


Pro 


Leu 










170 








175 










180 


Gly 


Leu 


Lys 


Trp 


His 


Gin 


Met 


Val 


Asn 


Asn 


He 


Cys 


Gin 


Phe 


He 




185 










190 










195 


Phe 


Trp 


Thr 


Val 


Phe 


He 


Leu 


Met 


Leu 


Val 


Phe 


Tyr 


Val 


Val 


He 








200 










205 










210 


Ala 


Lys 


Lys 


Tyr 


Met 


He 


Leu 


He 


Glu 


Ser 


Pro 


Lys 


Val 


Arg 


Thr 




215 










220 










225 


Glu 


Lys 


Thr 


Thr 


Lys 


Ser 


Trp 


Lys 


Ala 


Lys 


Tyr 


Leu 


Leu 


Ser 


Trp 











230 










O IT 

235 












Leu 


Ser 


Ser 


Leu 


Cys 
245 


Val 


Leu 


Leu 


His 


Phe 
250 


He 


Ser 


Pro 


Glu 


Phe 

o c c 


His 


He 


Leu 


Thr 


Val 


Lys 


Pro 


Thr 


He 


Arg 


Leu 


Thr 


Val 


Asp 


Cys 










260 








265 










270 


Lys 


He 


Asn 


Cys 


Leu 


Leu 


Leu 


Lys 


Lys 


Gin 


Leu 


Ser 


Phe 


Trp 


Gin 






275 










280 










285 


Gin 


Leu 


Thr 


Phe 


Val 
290 


Trp 


He 


Pro 

















(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 1301 BASE PAIRS 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS : SINGLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

TTTTGGGTAT TTCTGAGAAA AAGGAAATAT TTATAAAACC ATCCAAAGAT CCAGATAATT 60 

TGCAAATAAA TTGGAGGTTA TAGAGGTTAT AATCTGAATC CCAAAGGAGA CTGCAGCTGA 120 

TGAAAGTGCT TCCAAACTGA AAATTGGACG TGCCTTTACG ATGGTAAGCG TTAACAGCTC 180 

CCACTGCTTC TATAATGACT CCTTTAAGTA CACTTTGTAT GGGTGCATGT TCAGCATGGT 240 

GTTTGTGCTT GGGTTAATAT CCAATTGTGT TGCCATATAC ATTTTCATCT GCGTCCTCAA 300 

AGTCCGAAAT GAAACTACAA CTTACATGAT TAACTTGGCA ATGTCAGACT TGCTTTTTGT 360 

TTTTACTTTA CCCTTCAGGA TTTTTTACTT CACAACACGG AATTGGCCAT TTGGAGATTT 420 

ACTTTGTAAG ATTTCTGTGA TGCTGTTTTA TACCAACATG TACGGAAGCA TTCTGTTCTT 480 

AACCTGTATT AGTGTAGATC GATTTCTGGC AATTGTCTAC CCATTTAAGT CAAAGACTCT 54 0 

AAGAACCAAA AGAAATGCAA AGATTGTTTG ACATGGCGTG TGGTTAACTG TGATCGGAGG 60 0 

AAGTGCACCC GCCGTTTTTG TTCAGTCTAC CCACTCTCAG GGTAACAATG CCTCAGAAGC 660 

CTGCTTTGAA AATTTTCCAG AAGCCACATG GAAAACATAT CTCTCAAGGA TTGTAATTTT 72 0 

CATCGAAATA GTGGGATTTT TTATTCCTCT AATTTTAAAT GTAACTTGTT CTAGTATGGT 780 

GCTAAAAACT TTAACCAAAC CTGTTACATT AAGTAGAAGC AAAATAAACA AAACTAAGGT 84 0 

TTTAAAAATG ATTTTTGTAC ATTTGATCAT ATTCTGTTTC TGTTTTGTTC CTTACAATAT 900 

CAATCTTATT TTATATTCTC TTGTGAGAAC ACAAACATTT GTTAATTGCT CAGTAGTGGC 960 

AGCAGTAAGG ACAATGTACC CAATCACTCT CTGTATTGCT GTTTCCAACT GTTGTTTTGA 102 0 

CCCTATAGTT TACTACTTTA CATCGGACAC AATTCAGAAT TCAATAAAAA TGAAAAACTG 1080 

GTCTGTCAGG AGAAGTGACT TCAGATTCTC TGAAGTTCAT GGTGCAGAGA ATTTTATTCA 114 0 

GCATAACCTA CAGACCTTAA AAAGTAAGAT ATTTGACAAT GAATCTGCTG CCTGAAATAA 1200 

AACCATTAGG ACTCACTGGG ACAGAACTTT CAAGTTCCTT CAACTGTGAA AAGTGTCTTT 1260 

TTGGACAAAC TATTTTTCCA CCTCCAAAAG AAATTAACAC A 1301 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 344 AMINO ACIDS 

(B) TYPE: AMINO ACID 

(C) STRANDEDNESS: 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Met Val Ser Val Asn Ser Ser His Cys Phe Tyr Asn Asp Ser Phe 
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5 10 15 

Lvs Tyr Thr Leu Tyr Gly Cys Met Phe Ser Met Val Phe Val Leu 

20 25 30 

Gly Leu He Ser Asn Cys Val Ala He Tyr He Phe He Cys Val 

35 40 45 

Leu Lvs Val Arg Asn Glu Thr Thr Thr Tyr Met He Asn Leu Ala 

50 55 60 

Met Ser Asp Leu Leu Phe Val Phe Thr Leu Pro Phe Arg He Phe 

65 70 75 

Tvr Phe Thr Thr Arg Asn Trp Pro Phe Gly Asp Leu Leu Cys Lys 

80 85 90 

He Ser Val Met Leu Phe Tyr Thr Asn Met Tyr Gly Ser He Leu 

95 100 105 

Phe Leu Thr Cys He Ser Val Asp Arg Phe Leu Ala He Val Tyr 

110 115 120 

Pro Phe Lys Ser Lys Thr Leu Arg Thr Lys Arg Asn Ala Lys He 

125 130 135 

Val Cys Thr Gly Val Trp Leu Thr Val He Gly Gly Ser Ala Pro 

140 145 150 

Ala Val Phe Val Gin Ser Thr His Ser Gin Gly Asn Asn Ala Ser 

155 160 165 

Glu Ala Cys Phe Glu Asn Phe Pro Glu Ala Thr Trp Lys Thr Tyr 

170 175 180 

Leu Ser Arg He Val He Phe He Glu He Val Gly Phe Phe He 

185 190 195 

Pro Leu He Leu Asn Val Thr Cys Ser Ser Met Val Leu Lys Thr 

200 205 210 

Leu Thr Lys Pro Val Thr Leu Ser Arg Ser Lys He Asn Lys Thr 

215 220 225 

Lvs Val Leu Lys Met He Phe Val His Leu He He Phe Cys Phe 
^ 230 235 240 

CVS Phe Val Pro Tyr Asn He Asn Leu He Leu Tyr Ser Leu Val 

245 250 255 

Arg Thr Gin Thr Phe Val Asn Cys Ser Val Val Ala Ala Val Arg 
^ 265 270 



260 



Thr Met Tyr Pro He Thr Leu Cys He Ala Val Ser Asn Cys Cys 

275 280 285 

Phe Asp Pro He Val Tyr Tyr Phe Thr Ser Asp Thr He Gin Asn 

290 295 300 

Ser He Lys Met Lys Asn Trp Ser Val Arg Arg Ser Asp Phe Arg 

305 310 315 

Phe Ser Glu Val His Gly Ala Glu Asn Phe He Gin His Asn Leu 

320 325 330 

Gin Thr Leu Lys Ser Lys He Phe Asp Asn Glu Ser Ala Ala 

335 340 

(2) INFORMATION FOR SEQ ID NO: 9: 
(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 30 BASE PAIRS 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS : SINGLE 

(D) TOPOLOGY: LINEAR 
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(ii) MOLECULE TYPE: Oligonucleotide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

GACTAAAGCT TAATGAGTAG TGAAATGGTG 

(2) INFORMATION FOR SEQ ID NO: 10: 
(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 31 BASE PAIRS 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS : SINGLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: Oligonucleotide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10 

GAACTTCTAG ACCCTCAGGG TTGTAAATCA G 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 30 BASE PAIRS 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: SINGLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: Oligonucleotide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11 

GACTAAAGCT TAATGAGGCC CACATGGGCA 

(2) INFORMATION FOR SEQ ID NO: 12: 
(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 32 BASE PAIRS 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: SINGLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: Oligonucleotide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 

GAACTTCTAG ACGAACTAGT GGATCCCCCC GG 

(2) INFORMATION FOR SEQ ID NO: 13: 
(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 30 BASE PAIRS 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: SINGLE 

( D ) TOPOLOGY : L INEAR 

(ii) MOLECULE TYPE: Oligonucleotide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 

GACTAAAGCT TAATGGCGTC TTTCTCTGCT 
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(2) INFORMATION FOR SEQ ID NO: 14: 
(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 30 BASE PAIRS 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS : SINGLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: Oligonucleotide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

GAACTTCTAG ACTTCACACA GTTGTACTAT 30 

(2) INFORMATION FOR SEQ ID NO: 15: 
(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 30 BASE PAIRS 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: SINGLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: Oligonucleotide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

GACTAAAGCT TAATGGTAAG CGTTAACAGC 3 0 

(2) INFORMATION FOR SEQ ID NO: 16: 
(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 31 BASE PAIRS 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: SINGLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: Oligonucleotide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

GAACTTCTAG ACTTCAGGCA GCAGATTCAT T 31 

(2) INFORMATION FOR SEQ ID NO: 17: 
(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 34 BASE PAIRS 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: SINGLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: Oligonucleotide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

GTCCAAGCTT GCCACCATGA GTAGTGAAAT GGTG 34 

(2) INFORMATION FOR SEQ ID NO: 18: 
(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 58 BASE PAIRS 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: SINGLE 
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(D) TOPOLOGY: LINEAR 



(ii) MOLECULE TYPE: Oligonucleotide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

CTAGCTCGAG TCAAGCGTAG TCTGGGACGT CGTATGGGTA GCAGGGTTGT AAATCAGG 

(2) INFORMATION FOR SEQ ID NO: 19: 
(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 34 BASE PAIRS 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS : SINGLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: Oligonucleotide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

GTCCAAGCTT GCCACCATGG TTGGTGGCAC CTGG 

(2) INFORMATION FOR SEQ ID NO: 20: 
(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 58 BASE PAIRS 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: SINGLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: Oligonucleotide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

CTAGCTCGAG TCAAGCGTAG TCTGGGACGT CGTATGGGTA GCAGTGGATC CCCCGTGC 

(2) INFORMATION FOR SEQ ID NO: 21: 
(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 34 BASE PAIRS 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: SINGLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: Oligonucleotide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

GTCCAAGCTT GCCACCATGA ACACCACAGT AATG 

(2) INFORMATION FOR SEQ ID NO: 22: 
(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 61 BASE PAIRS 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: SINGLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: Oligonucleotide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
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CTAGCTCGAG TCAAGCGTAG TCTGGGACGT CGTATGGGTA GCAAGGGATC CATACAAATG 60 
T 61 

(2) INFORMATION FOR SEQ ID NO: 23: 
(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 34 BASE PAIRS 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS : SINGLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: Oligonucleotide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

GTCCAAGCTT GCCACCATGG TAAGCGTTAA CAGC 34 

(2) INFORMATION FOR SEQ ID NO: 24: 
(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 61 BASE PAIRS 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: SINGLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: Oligonucleotide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:24: 

CTAGCTCGAG TCAAGCGTAG TCTGGGACGT CGTATGGGTA GCAGGCAGCA GATTCATTGT 60 
C 61 

(2) INFORMATION FOR SEQ ID NO: 25: 
(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 30 BASE PAIRS 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: SINGLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: Oligonucleotide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

CGGGATCCCT CCATGAGTAG TGAAATGGTG 30 

(2) INFORMATION FOR SEQ ID NO: 26: 
(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 2 9 BASE PAIRS 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: SINGLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: Oligonucleotide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2&: 

CGGGATCCCG CTCAGGGTTG TAAATCAGG 29 
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